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Abstract 

Crowdsourcing offers unprecedented potential for solving tasks efficiently 
by tapping into the skills of large groups of people. However, the salient 



(/3 ' feature of crowdsourcing — its openness of entry makes it vulnerable to 

O . 

malicious behavior. A firm that decides whether or not to resort to crowd- 

^ \ sourcing has to consider the tradeoff between the potential for increased 

00 ' 

■^ ' productivity with the possibility of being set back by malicious behavior. 

in 

en ■ We study this tradeoff in a game where tw^o firms compete against each 

Q \ other to obtain a better solution to a task. We characterize the equilibrium 

cn ■ 

behavior of the firms as a function of the comparative advantage afforded 
by using cro^vdsourcing vs. in-house solution; the vulnerability of crow^d- 

X. . 

\-{ . sourcing to an attack; and the cost of undertaking this attack. Our results 

show that malicious behavior is the norm, not the anomaly, as it tends 
to emerge under a wide variety of conditions — a result contrary to the 
conventional wisdom in the area. Counterintuitively, increasing the cost 
of an attack has no deterring effect on malicious behavior, and favors the 
w^eakest, least productive firm. These findings have cautionary implica- 
tions for the design of crowdsourcing competitions; for corporate decisions 
on w^hether to crow^dsource or not; and ultimately for understanding the 
long-term prospects of open, collective problem solving. 



Numerous successful examples of the power of crowdsourcing to solve problems of extreme 







difficulty |ll-ll4j have overshadowed important episodes where elaborate sabotage derailed 
or severely hindered collective efforts. The winning team in the DARPA Network Challenge 
obtained the locations of the ten balloons after spending significant efforts filtering a ma- 
jority of false submissions, including fabricated pictures containing individuals in disguise 



impersonating DARPA officials 15| . A team from the University of California at San Diego 



lost its lead in the DARPA Shredder Challenge after their puzzle prog ress was completely 
wiped-out by a relentless number of coordinated overnight attacks [l6|, ll7|. The team that 
topped the US Department of State sponsored Tag Challenge had to withstand a smear 
campaign orchestrated in Twitter that may have reduced its credibility [lO| • Beyond crowd- 
sourcing competitions, Ushahidi's collective confiict mapping for the Arab Spring had to be 
ong periods of time due to suspicions that it had been infiltrated by govern- 



shut down for 

ment officials 17||. This fragility of crowdsourcing can also be used in a positive manner: 
targeted attacks can exploit the fragility of decentralized, hierarchical recruitment of on-line 
extremists, effective slowing down their radicalization and training process 18|. 

These episodes have received a response in the form of emerging work at the intersection 
of the computer and economic sciences. Recent results in this area have elucidated that it 
is possible to design incentive structures and algorithmic strategies to modify the efficacy 
and quality of the crowdsourced solution, the vulnerability of crowdsourcing to malicious 



behavior, and the cost of undertaking it 19l-l42|. These three factors can attain a wide 



variety of values depending on the particularities of the problem at hand, the economic 
incentives at stake, as well as the algorithmic platform supporting the collective effort. For 
instance, the combinatorial nature of puzzle-assembly in the DARPA Shredder Challenge 
makes the solution very vulnerable to an attack at a low cost — destroying puzzle progress 
is much easier than creating it. On the other hand, fabricating a false ballon sighting in the 
DARPA Balloon Challenge is costly, as it involves a certain degree of counterintelligence 
skills, more so when individuals posing in false submissions risk identifiability — while at 
the same time, a false submission does not affect the validity of the other submissions. 

In this paper, we propose for the first time a formal analysis to explore the efficacy- 
vulnerability tradeoff of crowdsourcing. We adopt a competitive scenario where two firms 
(players) compete against each other to obtain a better solution to a task which can be 
crowdsourced. Our goal is to understand whether malicious behavior exemplified above is 



the norm or the anomaly, i.e. whether we should expect players to undertake attacks on the 
collective progress of competing players. We also aim to at quantifying whether this mali- 
cious behavior may affect the likelihood of using crowdsourcing as a strategy, and ultimate 
how it impacts the social welfare under collective problem solving. Such an understanding 
this will be helpful for the design of future crowdsourcing competitions, as well as support 
the decision process of institutions and firms considering crowdsourcing. 

MODEL 

We study a non-cooperative situation where two players (or firms) compete to obtain a better 
solution to a given task. The firm with the better solution wins and receives a reward of R. 
Each firm can obtain an in-house solution or crowdsource the task. The former is referred 
to as the closed strategy, the latter — as open. We focus on scenarios where open strategies 
are likely to be more efficient even though the exact level of efficiency is not known until 
after a firm engages in crowdsourcing - this reflects a high level of uncertainty underlying 



engagement processes in social networks |43H50|. Also, open strategies are susceptible to 



attacks: a flrm using an open strategy can be attacked by the competing flrm. The resulting 
damage from the attack may signiflcantly impair the other flrm, letting the attacking flrm 
win. 

In the model, flrms flrst decide whether or not to crowdsource, and then whether or not to 
attack the competitor if she is using an open strategy. Formally, we have a two-stage game. 
In the flrst stage, the flrms decide whether to solve the task in-house S or to crowdsource 
it C. Then the efficiencies of the firms that chose to crowdsource are observed. Note that 
the efficiency, or productivity, of a crowdsourcing strategy used by player i is known to both 
players. We use Pi to denote the productivity random variable and pi its realized value, 
respectively. The productivity of the in-house solution is fixed and normalized to zero. We 
assume that the productivity of a crowdsourcing strategy is uniformly distributed between 
and 1: Pi ~ f/[0, 1]. In the absence of attacks, the firm with a higher productivity wins. 

In the second stage, the players decide whether or not to attack the opponent (attacking 
is denoted by A and not attacking by A^). An attack is costly, and the cost q G (0, 1) is 
expressed as a fraction of the total reward R. This cost can represent a range of situations: 
the human effort in disrupting the opponent's solution; the complexity of creating multiple 



identities to carry out a Sybil attack; some financial punishment received when the attack is 
detected in a crowdsourcing competition. The damage inflicted by the attack is denoted by 
d G (0, 1), which determines how much productivity is taken away from the open strategy 
(equivalently, how much more productive the attacking flrm becomes after "stealing" the 
crowdsourced solution). The flrm that has a higher productivity at the end of the second 
stage wins the prize R, which is normalized to be i? = 1. The parameters q and d are 
publicly known. We characterize the equilibrium of this two-stage game as a function of 
these parameters. 

EQUILIBRIUM ANALYSIS 

We flnd the subgame perfect equilibrium of the two-stage game. Each pair of decisions made 
in the flrst stage results in a different second stage game. We flrst analyze ever pair of second 
stage decisions. Later we describe the payoff matrix for the game played in the flrst stage 
and derive the equilibrium for the overall crowdsourcing game. 

When both players use an in-house strategy S, there is no reason to attack, and they 
both choose A^ in the second stage. Each one is equally likely to win and the expected utility 
of each is |. 

When player 1 crowdsources and player 2 does not, player 1 has no reason to attack, 
but player 2 attacks if doing so puts him ahead of player 1. Player 2 attacks if the realized 
productivity of player 1 is less than the damage pi < d player 2 can inflict. Recall that 
we normalized productivity of an in-house solution to zero. The expected utility of player 
1 before his productivity is realized is 1 — d: i.e., he receives the payoff of 1 when his 
productivity is high enough to not be attacked, which, for uniformly distributed productivity, 
happens with probability 1 — d. The expected utility of player 2 before the productivity of 
player 1 is realized is d{l — q): i.e., the productivity of player 1 is low enough to be overtaken 
after an attack (which happens with probability d), so player 2 attacks and receives the 
reward minus the cost: 1 — q. Note that player 2 attacks for any cost of attack q G [0, 1): 
attacking brings a positive utility while not attacking results in zero utility. The case when 
player 2 crowdsources, and player 1 does not is symmetric. 

The most interesting case is when both players crowdsource. Let pi and p2 denote their 
productivities, which are known before they decide on attacking. Consider the case when 



Pi > p2- The other case is symmetric. 

If the difference in productivities pi — d > p2 oi the players is so large that the attack by 
player 2 does not let her reach player 1, then attacks do not change the outcome and neither 
player attacks. In this case, player 1 receives the utility of 1, while player 2 receives the 
utility of 0. For the case, when the attack of player 2 puts her ahead of the non-attacking 
player 1 (i.e, pi — d < P2), player 2 would like to attack if player 1 does not. At the same 
time, player 1 would like to attack in order to keep its lead only if player 2 is attacking. 
Consequently, there is no pure strategy equilibrium of this game. The payoff matrix showing 
utilities for each player appears below. 
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The game possesses a unique mixed equilibrium. Let h, I2 denote the probabilities that, 
respectively, players 1 and 2 attack. In a mixed equilibrium, player 1 is indifferent between 
choosing either action; the same holds for player 2. Thus, the mixed equilibrium is given by 

l-q=l-l2 

/i(-g) + (l-/i)(l-g)=0 

In this mixed equilibrium, player 1 attacks with probability li = 1 — q, and player 2 attacks 
with probability I2 = q. The higher the cost, the more likely the weak player (i.e., player 
2) is to attack and the less likely is the strong one. This behavior contradicts the intuition 
that making attacks more costly helps prevent them. The reason the intuition does not 
hold for player 2 is explained by the equilibrium strategy of player 1, who attacks less when 
the cost is higher. Whenever both players attack, player 1 wins with certainty, and player 
2 would have preferred to avoid the useless and costly attack. Thus, the less player 1 is 
likely to attack (i.e., the higher the cost), the more eager player 2 is to attack. For example, 
when attacking is free, player 1 always attacks and always wins obtaining the utility of 1. 
When the cost is the same as the prize, Q' = 1, player 2 always attacks taking the prize away 
from player 1 who does not attack resulting in zero utility for both. This leads to another 
counterintuitive result: the higher the cost of attack, the more likely is the weak player to 
win. In other words, instead of protecting the better crowdsourcing strategy, a higher cost 
of attack results in the weaker strategy winning after a unilateral attack. 



The equilibrium utility of the weak player is zero, while the equilibrium utility of the 
strong player is 1 — q'. The expected utility of player 1 (and symmetrically of player 2) when 
both players crowdsource but before they learn their productivities is 

ui = Pr(P2 < Pi < F2 + d){l -q) + Pr(P2 + d < Pi) 

The first term corresponds to the utility of player 1 in the mixed equilibrium of the game 
described above, and the second term corresponds to player 1 winning with certainty when 
player 2 cannot reach him even after attacking. The utility of player 2 is defined symmet- 
rically. Under the assumption that Pi and P2 are uniformly distributed between zero and 
one, we derive 

/"I /"I /"I /•mm(p2+d,l) 

Pr(P2 < Pi < P2 + d) = / I{p2<p^<p2+d}dp2dpi = ldpidp2 
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= / (min(p2 + d,l) - P2)dp2 = / ddp2 + (1 - P2)dp2 
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= (1 - d)d + (1 - 1) - (1 - d - il^) =d-^ 
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The ex-ante utihty of either player is 






Ui = U2 = {d- y)(l -q) + {--d+-)l = --{d- 







The utility of players decreases in both q and d. At the extreme case when both q and 
d are 1, the utility is zero. Whenever either of the parameters is at its minimum value of 
zero, the utility is at its maximum value of |. It is interesting to observe that the expected 
utility decreases in the cost of attack. This result is surprising as it contradicts the intuition 
that a higher cost of attacking leads to fewer attacks and, thus, to higher social welfare 
(that is, to a higher total utility of both players). However, the total rate of attacks does 
not change with q: the weak player attacks with probability q and the strong player attacks 
with probability 1 — g, so in expectation there is one attack for any q. The higher cost of 
attacks decreases the utility of the attacking player, resulting in the lower social welfare. 

iHSJ], the unique equilibrium for both players is to choose the 



In the Prisoner's Dilemma 



M 



action that hurts the other player: i.e., to defect. In the resulting equilibrium, both players 



are hurt. A similar situation (although in mixed strategies) arises in the crowdsourcing 
game. Both players choose to attack (in equilibrium, the expected number of attacks is 1), 
while attacks incur unproductive costs. When both players attack, the outcome is the same 
as when neither player attacks except that each player incurs the cost of attack. 

Having analyzed second stage games, we can now describe the payoff matrix for the 
game played in the first stage. Note that for the first stage, players are unaware of their 
productivities, should they choose crowdsourcing. Thus, the payoff matrix contains their 
expected utilities, depicted in Table I. 
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TABLE I. Expected payoff matrix for the crowdsourcing game. 

When the damage inflicted by an attack is high, c? > |, an in- house solution is an 
equilibrium strategy for both players. CC is also an equilibrium when d and q satisfy 
\ — [d — y)q' > (^(1 — (i) or equivalently q > ^^r^- Intuitively, players are more likely to 
crowdsource when attacking costs a lot (higher q) and is not effective (lower d). For d > ^ 
this occurs for values of q above the line in Figure [H 
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FIG. 1. Minimum q to support CC as an equilibrium for d > ^. 



When c? < |, CC is the only equihbrium for any value of q. As we discussed, regardless 
of g, the expected number of attacks when both firms crowdsource is 1. Thus, attempts to 
prevent attacks by raising the cost of attacking (e.g., making it more difficult) do not work. 

DISCUSSION 

Our results bear resemblance to the Prisoner's Dilemma but paint an even starker picture. 
When both players crowdsource (i.e., choose a more efficient way of performing the task) and 
are close to each other (specifically, within the damage inflicted by attacking, as it was the 
case in the DARPA Network Challenge) in terms of solution quality, the expected number of 
attacks is one regardless of the cost of the attack. Increasing the cost of attack offers no pos- 
sible deterrence. Therefore, under our simplifled model, malicious behavior is the expected 
behavior, not the anomaly, as it tends to emerge under a wide variety of conditions. This 
result is contrary to the conventional wisdom in the area where, despite signiflcant progress 
towards flltering missinformation has been made, discussion about malicious behavior has 
been generally absent. 

In our work, we considered the relative performance of the flrms while ignoring the 
absolute quality of the solution. This explains why the highest expected social welfare 
is obtained when neither flrm crowdsources, and therefore, neither flrm attacks (see the 
payoff of the {S,S) strategy in Table [T]). This is suitable for many competition settings 
where only relative performance is important (such as the aforementioned DARPA Network 
Challenge, DARPA Shredder Challenge, or Tag Challenge). Requiring a certain minimum 
solution quality and modeling the cost of effort are interesting extensions for future work. 
For example, one could model effort as time required to flnd a solution, with the time being 
inversely related to productivity (e.g., t = 1—p). Competitions may also mitigate aggression 
by using reward mechanisms where the reward received by the winner has a dependence on 
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the global progress of all teams - linking crowdsourcing games to Public Good Games 

This would be an avenue to enforce that global progress, irrespectively of who is producing 

it, has an incentive to be preserved by all players. 

Repeated encounters in crowdsourcing competitions may provide opportunities for the 
emergence of a richer set of strategies that can exploit tradeoffs between malicious and non- 



malicious behavior, as is it the case in the Iterated Prisioner's Dilemma 
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-l6l|. It would 



be also interesting to study how the presence of more than two-players affects the strategies 
displayed. We hypothesize that a multi-player setting would further penalize the strongest 
crowdsourcing player — this time favoring the mid-level players instead of the weakest — 
as the stronger player is likely to absorb multiple attacks after her performance is revealed. 

Our results emphasize that despite crowdsourcing being a more efficient way of ac- 
complishing many tasks, it also a less secure approach. In scenarios of "competitive" 
crowdsourcing, where there is an inherent desire to hurt the opponent, attacks on crowd- 
sourcing strategies are essentially unavoidable. We expect these surprising results derived 
in our stylized model to hold in a variety of more complicated scenarios that exhibit the 
fundamental tension between openness, efficiency and vulnerability. 
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